EMPIRICAL NULL AND FALSE DISCOVERY RATE INFERENCE FOR EXPONENTIAL FAMILIES BY ARMIN SCHWARTZMAN1 Harvard School of Public Health and Dana-Farber Cancer Institute
نویسنده
چکیده
In large scale multiple testing, the use of an empirical null distribution rather than the theoretical null distribution can be critical for correct inference. This paper proposes a “mode matching” method for fitting an empirical null when the theoretical null belongs to any exponential family. Based on the central matching method for z-scores, mode matching estimates the null density by fitting an appropriate exponential family to the histogram of the test statistics by Poisson regression in a region surrounding the mode. The empirical null estimate is then used to estimate local and tail false discovery rate (FDR) for inference. Delta-method covariance formulas and approximate asymptotic bias formulas are provided, as well as simulation studies of the effect of the tuning parameters of the procedure on the bias-variance trade-off. The standard FDR estimates are found to be biased down at the far tails. Correlation between test statistics is taken into account in the covariance estimates, providing a generalization of Efron’s “wing function” for exponential families. Applications with χ2 statistics are shown in a family-based genomewide association study from the Framingham Heart Study and an anatomical brain imaging study of dyslexia in children.
منابع مشابه
Empirical Null and False Discovery Rate Inference for Exponential Families
In large scale multiple testing, the use of an empirical null distribution rather than the theoretical null distribution can be critical for correct inference. This paper proposes a “mode matching” method for fitting an empirical null when the theoretical null belongs to any exponential family. Based on the central matching method for z-scores, mode matching estimates the null density by fittin...
متن کاملMultiple Testing of Submatrices of a Precision Matrix with Applications to Identification of Between Pathway Interactions
Making accurate inference for gene regulatory networks, including inferring about pathway by pathway interactions, is an important and difficult task. Motivated by such genomic applications, we consider multiple testing for conditional dependence between subgroups of variables. Under a Gaussian graphical model framework, the problem is translated into simultaneous testing for a collection of su...
متن کاملEmpirical null and false discovery rate analysis in neuroimaging
Current strategies for thresholding statistical parametric maps in neuroimaging include control of the family-wise error rate, control of the false discovery rate (FDR) and thresholding of the posterior probability of a voxel being active given the data, the latter derived from a mixture model of active and inactive voxels. Correct inference using any of these criteria depends crucially on the ...
متن کاملFalse Discovery Rate Analysis of Brain Diffusion Direction Maps By
Diffusion tensor imaging (DTI) is a novel modality of magnetic resonance imaging that allows noninvasive mapping of the brain’s white matter. A particular map derived from DTI measurements is a map of water principal diffusion directions, which are proxies for neural fiber directions. We consider a study in which diffusion direction maps were acquired for two groups of subjects. The objective o...
متن کاملFalse Discovery Rate Analysis of Brain Diffusion Direction Maps
Diffusion tensor imaging (DTI) is a novel modality of magnetic resonance imaging that allows noninvasive mapping of the brain’s white matter. A particular map derived from DTI measurements is a map of water principal diffusion directions, which are proxies for neural fiber directions. We consider a study in which diffusion direction maps were acquired for two groups of subjects. The objective o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009